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(54) A fusion protein and its use for the simultaneous detection of autoantibodies related to 
insulin-dependent diabetes mellitus 

(57) The invention relates to a fusion protein having 
epitopes of at least two of the autoantigens glutamic 
acid decarboxylase (GAD65), islet ceil antigen (IA2) 
and preproinsulin (PPINS) wherein said epitopes are 
connected with a linker peptide. The fusion protein must 
be able to bind to a solid phase. 

The invention also concerns the cDNA, and a vec- 
tor and cell comprising said cDNA. Furthermore, this 
invention relates to the use of said fusion protein in an 
immunoassay for the simultaneous detection of autoan- 
tibodies related to insulin -dependent diabetes mellitus. 
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Description 

FIELD OF THE INVENTION 

5 [0001] This invention relates to a new fusion protein, its cDNA, and a vector and a cell comprising said cDNA. Fur- 
thermore, this invention relates to the use of said fusion protein in an immunoassay for simultaneous detection of 
autoantibodies related to insulin dependent diabetes mellitus. 

BACKGROUND OF THE INVENTION 

10 

[0002] The publications and other materials used herein to illuminate the background of the invention, and in partic- 
ular, cases to provide additional details respecting the practice, are incorporated by reference. 

[0003] GAD65, IA2 and insulin are pancreatic proteins produced by the beta cells (for review see Atkinson and 
Maclaren 1993). Autoantibodies to these proteins are detected in patients with insulin-dependent diabetes mellitus 

is (ID DM) and healthy individuals at risk for developing the disease. More than 80 % of newly-diagnosed IDDM patients 
have antibodies against at least one of these proteins (Baekkeskov et al. 1982). The risk of diabetes in relatives of 
IDDM patients increases markedly when the number of autoantibodies detected in the serum increases (Bingley et al. 
1994; Verge et al. 1994). In a group of high genetic risk, presence in serum of antibodies to one or more of these 
autoantigens predicted the disease onset accurately (Verge et al. 1 996). Also permanently healthy subjects (as regards 

20 IDDM) may have temporarily or permanently antibodies against one of the three antigens, but antibodies against mul- 
tiple antigens occur extremely rarely. It is therefore sought to simultaneously determine reactivity against two or ali three 
of the proteins, as the positivity for more than one of these autoantibodies remarkably increases disease risk (Bingley 
etal. 1994). 

[0004] GAD65 (Bu et al. 1992) has several epitopes recognised by autoantibodies (Falorni et al. 1996). These are 
25 located mostly at the center and C-terminus of the molecule whereas the N-terminal quarter of the molecule is thought 
to contribute to membrane docking of the protein, and to contain few if any IDDM-informative epitopes (Falorni et al. 
1996). 

[0005] IA2 (also known as ICA512) (Rabin et al. 1994) is a transmembrane protein with still unknown function. The 
intracellular part of the molecule (\A£ K , about 40 kDa) contains a domain with similarity to the active center of protein 
30 phosphatases (Fischer et al. 1991), but no enzymatic activity has been ascribed the IA2 molecule. The informative 
epitopes of IA2 reside in the cytoplasmic domain and herein they are concentrated at the C-terminal half (Lampasona 
et al. 1996; Zhang et al. 1997). 

[0006] Insulin (Bell et al. 1980) is made by pancreatic p-cells as a precursor preproinsulin which is cleaved to proin- 
sulin. The proinsulin is further processed to give the insulin consisting of A and B chains connected together with two 
35 disulphide bridges. 

[0007] More than 20% of sera collected from newly-diagnosed IDDM-patients contain insulin autoantibodies (IAA) 
(Sabbah et al. 1996). As, however, the immunity to insulin may have arisen through formation of response to prepro- or 
proinsulins (Snorgaard et al. 1996), it is relevant to use these peptides in this assay system. Tolerance to this autoanti- 
gen may be induced by oral insulin feeding in non-obese diabetic (NOD) mice (Zhang et al. 1991). 

40 [0008] In addition to linear epitopes, autoantibodies are thought to recognize important conformational epitopes 
resulting from the three-dimensional structure of the protein (Kim et al. 1993). Antigen molecules produced or assayed 
using techniques which destroy these structures are less informative as regards IDDM or prediabetes. 
[0009] Several methods for detection of autoantibodies in IDDM sera have been elaborated. One method exploits in 
vitro transcription-translation for producing radioactively labeled autoarrtigen (IA2, GAD65) (Petersen etal. 1994), while 

45 in another method biotin -labeled GAD65 is added to the patient sera and after formation of immune complexes, free 
label is detected and quantitated (Mehta et al. 1996). These methods all suffer from suboptimal niveau of informativrty, 
as they employ only one specific autoarrtigen. Moreover they have the drawbacks associated with the use of radiochem- 
icals. 

[0010] Using a protein molecule in which a combination of the epitopes from at least two but preferably three different 
50 autoantigens are represented should detect a larger panel of autoantibodies thus revealing more specifically the popu- 
lation of individuals at risk of developing the disease. 

SUMMARY OF THE INVENTION 

55 [001 1] According to one aspect, this invention relates to a new fusion protein having epitopes of at least two of the 
autoantigens glutamic acid decarboxylase (GAD 65), islet cell antigen (IA2) and preproinsulin (PPINS) wherein said 
epitopes are connected with a linker peptide, said fusion protein being able to bind to a solid phase. 
[0012] According to another aspect, the invention concerns a cDNA sequence encoding the said fusion protein. 
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[0013] According to a third aspect, the invention concerns a vector and a cell comprising said cDNA. 
[0014] According to a fourth aspect, the invention concerns an immunoassay for the simultaneous determination in a 
sample of a person's body fluid of at least two insulin -dependent diabetes mellitus (IDDM) -related autoantibodies, 
wherein each autoantibody is specific for an epitope of the autoantigens glutamic acid decarboxylase (GAD65), islet cell 
5 antigen (IA2) or preproinsulin (PPINS). The immunoassay comprises the steps of 

incubating said sample with said autoantigens or, alternatively, with the fusion protein according to this invention, 
said autoantigens or said fusion protein being bound to a solid support, 

adding at least one labeled reagent capable of binding to one or more of said autoantibodies, and 
io quantifying the signals from the labels bound to the solid phase. 

[0015] According to still one aspect, the invention concerns a method for diagnosing a person's risk of developing 
insulin-dependent diabetes mellitus (IDDM), said method comprising the determination in a sample of said person's 
body fluid of at least two insulin dependent diabetes mellitus (IDDM) -related autoantibodies specific for an epitope of 
15 the autoantigens glutamic acid decarboxylase (GAD65), islet cell antigen (IA2) or preproinsulin (PPINS), wherein the 
presence of at least two of said autoantibodies are indicative for said person's risk of developing IDDM. The order of 
appearance of these autoantibodies is used to predict the time point of onset of the disease. 

BRIEF DESCRIPTION OF THE DRAWINGS 

20 

[0016] 

Figures 1a and lb show the cDNA construct for a fusion protein according to this invention (flag peptide (SEQ ID 
NO: 1); Not I (SEQ ID NO: 2); poly-his (SEQ ID NO: 3) and Sgf I (SEQ ID NO: 4)), 

25 

Figure 2a shows the amino acid sequence of the IA2 protein SEQ ID NO: 5). 

Figure 2b shows the amino acid sequence of the GAD65 protein (SEQ ID NO: 6), 

30 Figure 2c shows the amino acid sequence of preproinsulin (PPINS) (SEQ ID NO: 7), 

Figure 3 shows the fusion protein according to this invention attached to a solid support, autoantibodies attached 
to epitopes of said protein, and labeled reagents bound to said autoantibodies, wherein the reagents are labeled 
with different labels, and 

35 

Figure 4 shows the fusion protein according to this invention attached to a solid support, autoantibodies attached 
to epitopes of said protein, and labeled reagents bound to said autoantibodies, wherein the reagents are labeled 
with the same label. 

40 [0017] The nucleotide sequence encoding GAD65 is SEQ ID NO: 8, the nucleotide sequence encoding IA2 is SEQ 
ID NO: 9 and the human insulin gene is SEQ ID NO: 10. 

DETAILED DESCRIPTION OF THE INVENTION 

45 [0018] The term "epitope" can be an amino acid sequence anything from very few (about 5 to 1 0) amino acids of the 
autoantigens up to the whole autoantigen. Preferable lengths of the epitopes are represented by the underlined amino 
acid sequences in Figures 2a and 2b, and the whole antigen sequence is disclosed in Figure 2c. Thus, the epitope of 
IA2 comprises preferably the amino acids 771-979 of the amino acid sequence shown in Figure 2a. Another preferred 
alternative is the whole intracellular domain (amino acids ranging from about 576 to 979 of the sequence in Figure 2a). 

so The epitope of GAD65 comprises preferably the amino acids 102-585 of the amino acid sequence shown in Figure 2b, 
and the epitope of PPINS comprises preferably all the amino acids 1-110 of the polypeptide shown in Figure 2c. It 
should be noted that the above mentioned specific sequences are examples only. 

[0019] According to a preferred embodiment, the fusion protein has epitopes of each of the autoantigens GAD65, IA2 
and PPINS. Such a fusion protein allows simultaneous detection of autoantibodies specific for any of said autoantigens. 
55 [0020] Said fusion protein containing epitopes of GAD65, IA2 and PPINS is formed by combining these domains via 
short peptides consisting of amino acid residues, e.g. lysine and arginine residues. 

[0021 ] The epitopes from distinct autoantigens will be linked together via short peptides containing e.g. several lysine 
residues, which allows preferential labeling of these lys-residues. For construction of the polygenic cDNA, the linker- 
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encoding cDNA contains a recognition site for a rarely cutting restriction enzyme such as Not I or Sgf I (see Figure 1a 
and 1b). 

[0022] These linker residues may be connected to a member of an affinity binding pair so as to enable the binding of 
said fusion protein to a solid phase. The bioaffinity pair may be e.g. biotin - streptavidin. The residues (lysine) can be 
5 biotinylated after which the fusion protein is attached to a streptavidin -coated solid phase. The solid phase can e.g. be 
a well of a microtitration strip or plate. Alternatively, the solid phase consists of microparticles. 

[0023] The fusion protein can alternatively be bound to the solid phase by direct adsorption. Furthermore, the fusion 
protein can be covalently linked to the solid phase. In this case the fusion protein must be provided with groups able to 
create a covalent bond with the solid phase. 
io [0024] Figures 2 and sequences SEQ ID NO: 8-10 show the amino acid sequences and the nucleotide sequences, 
respectively, of the preferred epitopes. 

[0025] The following illustrates the construction of the fusion protein and its preparation. 

[0026] The N-terminus of the hybrid protein and the single proteins will contain a flag peptide NH2-DYKDDDDK- 
COOH (SEQ ID NO: 1) with a free N-terminaJ amino group to allow recognition of the protein using M1 monoclonal antj- 
15 body (ATCC cell line nr. HB 9259). This enables detection of the protein in SDS-PAGE where not all monoclonals func- 
tion. 

[0027] At the carboxy-terminal end of the fusion protein and in the single antigens a motif X-X-G-S-H-H-H-H-H-H 
(SEQ ID NO: 1 1) is introduced to allow purification of the protein with metal chelate affinity chromatography and detec- 
tion with monoclonal antibody against this epitope (Cedarlane Laboratories Ltd, Canada). 
20 [0028] The GAD65 gene (Bu et al. 1992) is, for example, amplified with PCR (nucleotides 1311-1 755) in such a man- 
ner that 101 amino acid residues are removed from the N-terminus. 

[0029] The 3' -end oligonucleotide contains 17 bases complementary to the mRNA of GAD65 and an additional 
sequence encoding half of a peptide forming the bridge between GAD65 and IA2 domains. 
[0030] The nucleotide sequence of the bridge is for example 

25 

Not I 

GAD 6 5 - AAG AAG AAGCGGCCGCG AAAG AAG AAG - 1 A 2 (SEQ ID NO: 12? amino 
30 acid sequence of the peptide KKKRPRKKK (SEQ ID NO: 2)), or 



Sgf I 

GAD65-AAGAAGA AGCGATCGC GAAAGAAGAAG-IA2 (SEQ ID NO : 1 3 ; amino 

acid sequence KKKRSRKKK (SEQ ID NO: 4)). The restriction enzyme recognition sites are underlined in the middle. 
The fragments are made from a plasmid harbouring said cDNAs with PCR and digested with appropriate restriction 
enzymes (e.g. Not I or Sgf I) and cloned into appropriate vectors. The GAD65 part is linked to IA2 and this to PPINS, 

40 using general cloning techniques. 

[0031] The IA2 gene and the PPINS gene 5' -oligo contain half of the polylysine-arginine-encoding sequence with a 
Not I or Sgf I site for coupling to GAD65 and the IA2 gene 3'-end, respectively. The 3' -oligo of PPINS has a histidine 
hexapeptide-encoding sequence to enable antibody recognition and metal chelate chromatography purification and/or 
immobilization if necessary (Mauch et al. 1993). 

<5 [0032] Purified, restriction enzyme-treated PCR fragments are cloned in a FastBac derivative and E.coli DHIOBac 
cells are transfected with the plasmid. Recombinant clones are selected and DNA isolated and transfected into Sf9 
insect cells. 

[0033] Virus-producing cells are cultivated and stock virus made. Large-scale cultures are used to produce recom- 
binant single proteins and the polyprotein. 
so [0034] SDS-PAGE/Western analysis is used to analyse size and immunoreactivity of the recombinant polyproteins. 
The proteins are blotted onto a nitrocellulose or nylon membrane and GAD/IA2/PPINS antibodies used to detect the 
product visualised with enhanced chemiluminescence, ECL 

[0035] For purification of the polyprotein GAD65-specif ic monoclonal antibody (GAD6, Developmental Studies Hybri- 
doma Bank, Iowa University) is immobilized to Sepharose 4B activated with cyanogen bromide (Pharmacia, Uppsala, 
55 Sweden). Elution of the protein is performed at low pH (3-4) and solubility is achieved by adding detergents (e.g. Non- 
idet or Tween) to allow dissociation from for example residual cell debris. Alternatively, M1 antibody (ATCC cell line no. 
HB 9259) recognising the N-terminal flag epitope is coupled to Sepharose and the single proteins and the polyproteins 
are bound in the presence of calcium ions and elution is achieved via calcium depletion. 
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[0036] The steps from cloning to large scale production can be described in more detail as follows: 

1 . Cloning into the pK503-9 vector (Kari Keinanen VTT Finland), a derivative of pFastBac (Gibco BRL Paisley Scot- 
land) of GAD65, or IA2 or PPINS gene, each containing a flag recognition signal (FLAG R , Immunex Corporation) 
5 for antibody detection and a signal peptide for ecdysone glucotransf erase (EGT) for transport into the endoplas- 

matic reticulum for removal of the signal peptide with simultaneous release of N-terminal aspartate for Ml antibody 
recognition. The constructs contain each a X-X-G-S-H-H-H-H-H-H carboxyterminal peptide (SEQ ID NO: 11) to 
allow metal chelate affinity purification and detection with specific antibody (Cedarlane, Canada) of the product. 

10 2. Transformation into competent E. coli DHIOBac cells of the plasmids containing the single genes. 

3. Isolation of recombinant Bacmid DNA and transection with the fused DNA of the Sf9 or Hi-5 insect cells. 

4. Production of recombinant stock virus. 

15 

5. Large scale production of the proteins. 

6. Cloning into pK503-9 vector of a cDNA construct for the fusion protein (FP) comprising GAD 65 (nt 1311 -1755; 
aa 102-585) -I A2(nt 2313-2937; aa 771-979)PPINS (nt 2424-2610 and 3396-3539 (of the genomic DNA sequence, 

20 accession No. V00565); aa 1-1 10) in all alternative orders. 

7. Transformation into competent E. coli DHIOBac cells of the plasmids containing the fusion protein. 

8. Isolation of recombinant Bacmid DNA and transection with the fused DNA of the Sf9 or Hi-5 insect cells. 

25 

9. Production of recombinant stock virus. 

10. Large scale production of the fusion protein. 

30 [0037] In case the baculovirus expression system does not work optimally, alternative systems such as E.coli, yeast, 
or in vitro transcription translation assay (Petersen et al. 1 994) will be used for production of said polypeptides. 
[0038] The present invention relates further to the use of the fusion protein in an immunoassay for the detection of 
several pancreatic beta-cell autoantibodies in IDDM patients and prediabetic sera. The assay may detect patients at 
risk of developing IDDM, i.e. having a pre-IDDM condition. As a multicomponent assay, the method could also be used 

35 to predict the time point of onset of the disease. The methodology which combines epitopes of several islet beta cell 
autoantigens increases the informativity and prediction value of the test aimed at prediction of risk and onset of disease 
in individuals genetically predisposed to IDDM. 

[0039] In the immunoassay according to this invention, a sample of the person's body fluid (e.g. serum) is incubated 
with the fusion protein bound to a solid surface, e.g. a microtitration plate or solid gel beads. The bound autoantigens 

40 are thereafter detected with a labeled reagent The reagents can be the single autoantigens GAD65, IA2 and PPINS; 
or proteins comprising epitopes thereof. These reagents are used to detect free antigen-binding regions (V-regions) on 
the bound autoantibodies. One variant of the method will be used for differential detection of the individual autoantigen 
specificities of the antibody in one assay if individual autoantigens (AAGs) labeled with three different labels are used 
(see Figure 3). Alternatively, when the polyprotein (the fusion protein) is labeled with only one label, it can be used to 

45 reveal the sum of these three reactivities in the sample (Figure 4). The same result is achieved if the single antigens are 
all labeled with the same label. The labeled reagent can further be an anti-human monoclonal antibody. In this case the 
assay can reveal only the sum of the three autoantibodies. 

[0040] The technique which involves use of the label attached to the fusion protein or individual autoantigens circum- 
vents several problems encountered in the conventional assays. First, there is little or no nonspecific binding to the vials 

so due to the fact that the carrier surfaces have already been blocked with the corresponding antigen. Second, the attach- 
ment via a bioaff inity pair such as streptavidin/biotin interaction to the vial and use of a flexible peptide between the indi- 
vidual antigenic epitopes enable free motion and folding of the protein in the solution (Figure 4). 
[0041] The label can be any suitable label. However, according to a preferred embodiment, the label is a lanthanide. 
In case three different labels are used, said labels can be e.g. Eu, Sm. Tb and Dy (Siitari et al. 1990; Hemmila et al. 

55 1993). In such a case the detection is based on time- resolved fluorescence. 

[0042] The free labeled reagent can be removed after the incubation step before the signal is quantified (heterogene- 
ous assay), or the signal can be quantified without foregoing removal of the free labelled reagent (homogeneous 
assay). 
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[0043] The procedures are preferably automatized. Automatization of the procedures involves laboratory robots which 
apply samples onto cover slips and the fluorescence is detected in an micro array system in an appropriate unit (Wallac 
OY. Finland). 

[0044] The simultaneous detection of antibodies against the three autoantigens increases the capacity to process 
5 large sample series. The use of a micro array system substantially increases the capacity. This has become necessary 
as nationwide screenings of newborns are undertaken in several research centers. 

[0045] The test principle using time-resolved fluoroimmunoassay (TR-FIA) offers an extremely sensitive means for 
detection of autoantibodies with minimum amount of nonspecific reactivity due to used specific antigen label. The lon- 
gevity of the lanthanide label is also an advantage as compared to radiolabel. 
w [0046] The system allows retaining of important conformational epitopes of the antigen as immobilization of the poly- 
protein is via specific flexible intervening sequences and causes minimal tortion to the antigen. 
[0047] The following illustrates the use of the fusion protein in an immunoassay: 

[0048] To the polyprotein (fusion protein) biotin is bound in limiting conditions to prevent other than the lysine residues 
of the linker peptide to be biotinylated. Streptavidine-coated microscope slides are treated with biotin - fusion protein 

15 and the residual sites are blocked with bovine serum albumin or another suitable binding protein. 

[0049] M1 flag-specific monoclonal antibody will be used to monitor binding onto solid support of free recombinant 
autoantigens while autoantig en -specific monoclonals (e.g. GAD1, GAD6. MICA-3 (Boehringer) etc.) will be used to 
detect availability of specific epitopes. After incubation with sample sera, Eu-labeled GAD65, Sm-laJbeled IA2 and Tb- 
labeled PPINS (produced as a single protein with the baculosystem) are printed robotically onto the microscope slides 

20 in four quadrants covering an area of about 1 cm 2 , allowed to bind, washed and dried in vacuum, and the fluorescence 
is measured on TR f luorometer. 

[0050] The functionality of the method is tested using IDDM sera known to be positive for one or more of the antigens 
used. 

[0051] For specificity testing recombinant GAD65, IA2 and PPINS, or fusion protein are added into patient sample to 
25 preadsorb specific antibodies. 

[0052] The informativity will be compared with conventional systems. Statistical tests will be used to create best pos- 
sible segregation of the positive and negative assay values. 
[0053] The high density array system is fully automatized. 
[0054] The invention is further illustrated by the following examples. 

30 

Example 1 

Labeling procedure 

35 [0055] Isothiocyantophenyl-DTTA-Eu, or Tb, or Sm (Mukkala 1989) will be used for labeling of the FP or the single 
autoantigens. Mainly the protocols of Lovgren & Petterson (1990) and Hemmila et al. (1984) will be followed. 30-100 
fold molar excess of the label substance will be used giving approximately 1 0-1 2 lanthanide molecules per protein mol- 
ecule. For Tb, 500 fold excess will be used. The coupling is carried out for 18 hr at 0 °C in 0.1 M bicarbonate buffer pH 
9.2. The Eu (Tb.Sm)-AAg complex is separated from free Eu (Tb, Sm) by gel filtration in a Sepharose 6B column equil- 

40 ibrated with 0.05 M Tris-HCI buffer pH 7.75 containing 0.9% NaCI and 0.05% NaN 3 . The Eu-AAg complex is stored at 
4 °C. 

Example 2 
45 Immunoassay 

[0056] The assay is performed in the wells of polystyrene microtitration strip coated with unlabeled autoantigen pre- 
parate for 18 hr at 25 °C in 0.1 M bicarbonate buffer pH 9.6 (Siitari & Kurppa 1987). The strips are washed prior to use 
with 0.9% NaCI containing 0.05 % Tween 20 and 0.3% Germal II. To each well 100 ^l of diluted (1:10) serum is added 
so and incubated for 1 hr at 40 °C, washed 2x with the wash solution and 200 nl of the Eu-labeled autoantigen fraction (50 
ng/well) is added. 

[0057] The strips are incubated for 1 hr at 40 °C. The strips are washed 5x with the washing solution. Thereafter 
Enhancement Solution (Wallac) 200 nlTwell is added. Strips are shaken for 10 min in a plate shaker and measured in 
EG&G Wallac Victor f luorometer for Is/specimen. The photons emitted are measured as counts/s. Automated data 
55 reduction program calculates mean value of duplicates and the coefficient of variation (CV%). 

[0058] For future development, the assay formate will be miniaturized e.g. by immobilizing the autoantigen molecules 
onto microparticles (Lovgren et al. 1997) or as a microarray onto glass cover slips. 

[0059] It will be appreciated that the methods of the present invention can be incorporated in the form of a variety of 
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embodiments, only a few of which are disclosed herein. It will be apparent for the specialist in the field that other embod- 
iments exist and do not depart from the spirit of the invention. Thus, the described embodiments are illustrative and 
should not be construed as restrictive. 
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SEQUENCE LISTING 



15 



(1) GENERAL INFORMATION: 

5 

(i) APPLICANT: Hinkkanen, Ari 

(ii) TITLE OF INVENTION: A New Fusion Protein and Its Use in an 
Immunoassay for the Simultaneous Detection of 
Autoantibodies Related to Insulin-Dependent Diabetes 
W Mellitus 

(iii) NUMBER OF SEQUENCES: 13 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Rothwell, Figg, Ernst & Kurz, P.C. 
<B) STREET: 555 Thirteenth Street N.W., Suite 701-E 

(C) CITY: Washington 

(D) STATE: D.C. 

(E) COUNTRY: USA 

(F) ZIP: 20004 

(v) COMPUTER READABLE FORM: 
20 (A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC- DOS /MS- DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 

(vi) CURRENT APPLICATION DATA: 
25 (A) APPLICATION NUMBER: US 09/015,399 

(B) FILING DATE: 29-JAN-1998 

(C) CLASSIFICATION: 

(viii) ATTORNEY/AGENT INFORMATION: 
(A) NAME: Ihnen, Jeffrey I. 
<B) REGISTRATION NUMBER: 28,957 
30 (C) REFERENCE/DOCKET NUMBER: 2328-111 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 202-783-6040 

(B) TELEFAX: 202-783-6031 



35 



40 



45 



50 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
(v) FRAGMENT TYPE: N-terminal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l 

Asp Tyr Lys Asp Asp Asp Asp Lys 
1 5 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 
(A> LENGTH: 9 amino acids 



55 
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10 



15 



20 



30 



40 



(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

{ii} MOLECULE TYPE: peptide 
(v) FRAGMENT TYPE: internal 

{Xi} SEQUENCE DESCRIPTION: SEQ ID NO: 2 

Lys Lys Lys Arg Pro Arg Lys Lys Lys 
15 



(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 
{B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
(v) FRAGMENT TYPE: C-terminal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

25 Cys Asn Gly Ser His His His His His His 

15 10 

(2) INFORMATION FOR SEQ ID NO: 4: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 
35 (v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 



Lys Lys Lys Arg Ser Arg Lys Lys Lys 
1 5 

(2) INFORMATION FOR SEQ ID NO: 5: 



U) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 979 amino acids 

(B) TYPE: amino acid 
45 (C) STRANDEDNESS: 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 

50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Met Arg Arg Pro Arg Arg Pro Gly Gly Leu Gly Gly Ser Gly Gly Leu 
15 10 15 
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Arg Leu Leu Leu Cys Leu Leu Leu Leu Ser Ser Arg Pro Gly Gly Cys 
20 25 30 

Ser Ala Val Ser Ala His Gly Cys Leu Phe Asp Arg Arc Leu Cys Ser 
5 35 40 45 

Kis Leu Glu Val Cys lie Gin Asp Gly Leu Phe Gly Gin Cys Gin Val 
50 55 60 

Gly Val Gly Gin Ala Arg Pro Leu Leu Gin Val Thr Ser Pro Val Leu 

10 65 7 0 7 5 8 0 

Gin Arg Leu Gin Gly Val Leu Arg Gin Leu Met Ser Gin Gly Leu Ser 
85 90 95 



IS 



20 



25 



Trp His Asp Asp Leu Thr Gin Tyr Val lie Ser Gin Glu Met Glu Arg 
100 105 110 

lie Pro Arg Leu Arg Pro Pro Glu Pro Arg Pro Arg Asp Arg Ser Gly 
115 120 125 

Leu Ala Pro Lys Arg Pro Gly Pro Ala Gly Glu Leu Leu Leu Gin Asp 
130 " 135 140 

lie Pro Thr Gly Ser Ala Pro Ala Ala Gin His Arg Leu Pro Gin Pro 
145 " 150 155 160 

Pro Val Gly Lys Gly Gly Ala Gly Ala Ser Ser Ser Leu Ser Pro Leu 
165 170 175 

Gin Ala Glu Leu Leu Pro Pro Leu Leu Glu His Leu Leu Leu Pro Pro 
180 185 190 

Gin Pro Pro His Pro Ser Leu Ser Tyr Glu Pro Ala Leu Leu Gin Pro 
195 200 205 

Tyr Leu Phe His Gin Phe Gly Ser Arg Asp Gly Ser Arg Val Ser Glu 
210 215 ' 220 

Gly Ser Pro Gly Met Val Ser Val Gly Pro Leu Pro Lys Ala Glu Ala 
225 230 235 240 

Pro Ala Leu Phe Ser Arg Thr Ala Ser Lys Gly lie Phe Gly Asp His 
245 250 255 

Pro Gly His Ser Tyr Gly Asp Leu Pro Gly Pro Ser Pro A.la Gin Leu 
260 265 270 

40 Phe Gin Asp Ser Gly Leu Leu Tyr Leu Ala Gin Glu Leu Pro Ala Pro 

275 280 285 

Ser Arg Ala Arg Val Pro Arg Leu Pro Glu Gin Gly Ser Ser Ser Arg 
290 295 300 



30 



35 



45 



Ala Glu Asp Ser Pro Glu Gly Tyr Glu Lys Glu Gly Leu Gly Asp Arg 

305 310 315 ' 320 

Gly Glu Lys Pro Ala Ser Pro Ala Val Gin Pro Asp Ala Ala Leu Gin 

325 330 335 

Arg Leu Ala Ala Val Leu Ala Gly Tyr Gly Val Glu Leu Arg Gin Leu 

50 3 40 345 350 

Thr Pro Glu Gin Leu Ser Thr Leu Leu Thr Leu Leu Gin Leu Leu Pro 

355 360 365 



55 
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Lys Gly Ala Gly Arg Asn Pro Gly Gly Val Val Asn Val Gly Ala Asp 
370 ' 375 380 

lie Lys Lys Thr Met Glu Gly Pro Val Glu Gly Arg Asp Thr Ala Glu 
5 385 390 395 400 

Leu Pro Ala Arg Thr Ser Pro Met Pro Gly His Pro Thr Ala Ser Pro 
405 410 415 

Thr Ser Ser Glu Val Gin Gin Val Pro Ser Pro Val Ser Ser Glu Pro 
10 42 0 425 430 

Pro Lys Ala Ala Aro Pro Pro Val Thr Pro Val Leu Leu Glu Lys Lys 
435 440 445 

Ser Pro Leu Gly Gin Ser Gin Pro Thr Val Ala Gly Gin Pro Ser Ala 
15 450 455 460 

Arg Pro Ala Ala Glu Glu Tyr Gly Tyr lie Val Thr Asp Gin Lys Pro 
465 470 475 480 



20 



25 



30 



35 



Leu Ser Leu Ala Ala Gly Val Lys Leu Leu Glu lie Leu Ala Glu His 
485 490 495 

Val His Met Ser Ser Gly Ser Phe lie Asn He Ser Val Val Gly Pro 
500 505 510 

Ala Leu Thr Phe Arg He Arg His Asn Glu Gin Asn Leu Ser Leu Ala 
515 520 525 

Asp Val Thr Gin Gin Ala Gly Leu Val Lys Ser Glu Leu Glu Ala Gin 
530 535 540 

Thr Gly Leu Gin He Leu Gin Thr Gly Val Gly Gin Arg Glu Glu Ala 
545 550 555 560 

Ala Ala Val Leu Pro Gin Thr Ala His Ser Thr Ser Pro Met Arg Ser 
565 570 575 

Val Leu Leu Thr Leu Val Ala Leu Ala Gly Val Ala Gly Leu Leu Val 
580 585 590 

Ala Leu Ala Val Ala Leu Cys Val Arg Gin His Ala Arg Gin Gin Asp 
595 600 605 

Lys Glu Aro Leu Ala Ala Leu Gly Pro Glu Gly Ala His Gly Asp Thr 
610 615 620 

40 Thr Phe Glu Tyr Gin Asp Leu Cys Arg Gin His Met Ala Thr Lys Ser 

625 630 635 640 

Leu Phe Asn Arg Ala Glu Gly Pro Pro Glu Pro Ser Arg Val Ser Ser 
645 650 655 

45 Val Ser Ser Gin Phe Ser Asp Ala Ala Gin Ala Ser Pro Ser Ser His 

660 665 670 

Ser Ser Thr Pro Ser Trp Cys Glu Glu Pre Ala Gin Ala Asn Met Asp 
675 680 685 

He Ser Thr Gly His Met He Leu Ala Tyr Met Glu Asp His Leu Arg 
50 690 695 "700 

Asn Arg Asp Arg Leu Ala Lys Glu Trp Glr. Ala Leu Cys Ala Tyr Gin 
705 710 715 720 
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Ala Glu Pro Asn Thr Cys Ala Thr Ala Gin Gly Glu Gly Asn lie Lys 
725 730 "735 

Lys Asn Arg His Pro Asp Phe Leu Pro Tyr Asp His Ala Arg lie Lys 
5 740 "745 750 

Leu Lys Val Glu Ser Ser Pro Ser Arg Ser Asp Tyr lie Asn Ala Ser 
755 ^60 "765 

Pro He He Glu His Asp Pro Arg Met Pro Ala Tyr He Ala Thr Gin 
10 770 775 780 

Gly Pro Leu Ser His Thr He Ala Asp Phe Trp Gin Met Val Trp Glu 
785 ^90 795 BOO 



15 



20 



25 



30 



Ser Gly Cys Thr Val He Val Met Leu Thr Pro Leu Val Glu Asp Gly 
805 810 815 

Val Lys Gin Cys Asp Arg Tyr Trp Pro Asp Glu Gly Ala Ser Leu Tyr 
620 825 830 

His Val Tyr Glu Val Asn Leu Val Ser Glu His He Trp Cys Glu Asp 
835 840 845 

Phe Leu Val Arg Ser Phe Tyr Leu Lys Asn Val Gin Thr Gin Glu Thr 
850 855 860 

Arg Thr Leu Thr Gin Phe His Phe Leu Ser Trp Pro Ala Glu Gly Thr 
865 870 875 880 

Pro Ala Ser Thr Arg Pro Leu Leu Asp Phe Arg Arg Lys Val Asn Lys 
885 890 895 

Cys Tyr Arg Gly Arg Ser Cys Pro He He Val His Cys Ser Asp Gly 
900 90 5 910 

Ala Gly Arg Thr Gly Thr Tyr He Leu He Asp Met Val Leu Asn Arg 
915 920 925 

Met Ala Lys Gly Val Lys Glu He Asp He Ala Ala Thr Leu Glu His 
930 935 940 

Val Arg Asp Gin Arg Pro Gly Leu Val Arg Ser Lys Asp Gin Phe Glu 
945 950 955 960 

Phe Ala Leu Thr Ala Val Ala Glu Glu Val Asn Ala He Leu Lys Ala 
965 970 975 

40 Leu Pro Gin 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 585 amino acids 
45 (B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 

Met Ala Ser Pro Gly Ser Gly Phe Trp Ser Phe Gly Ser Glu Asp Gly 
1 5 10 15 

55 
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10 



15 



20 



25 



30 



35 



Ser Gly Asp Ser Glu Asn Pro Gly Thr Ala Arg Ala Trp Cys Gin Val 
20 25 30 

Ala Gin Lys Phe Thr Gly Gly He Gly Asn Lys Leu Cys Ala Leu Leu 
35 <0 <5 

Tyr Gly Asp Ala Glu Lys Pro Ala Glu Ser Gly Gly Ser Gin Pro Pro 
50 55 60 

Arq Ala Ala Ala Arg Lys Ala Ala Cys Ala Cys Asp Gin Lys Pro Cys 
65 70 75 80 

Ser Cys Ser Lys Val Asp Val Asn Tyr Ala Phe Leu His Ala Thr Asp 
85 90 95 

Leu Leu Pro Ala Cys Asp Gly Glu Arg Pro Thr Leu Ala Phe Leu Gin 
100 105 HO 

Asp Val Met Asn He Leu Leu Gin Tyr Val Val Lys Ser Phe Asp Arg 
115 120 125 

Ser Thr Lys Val He Asp Phe His Tyr Pro Asn Glu Leu Leu Gin Glu 
130 135 140 

Tyr Asn Trp Glu Leu Ala Asp Gin Pro Gin Asn Leu Giu Glu He Leu 
145 150 155 160 

Met His Cys Gin Thr Thr Leu Lys Tyr Ala He Lys Thr Gly His Pro 
165 170 175 

Arg Tyr Phe Asn Gin Leu Ser Thr Gly Leu Asp Met Val Gly Leu Ala 
180 185 190 

Ala Asp Trp Leu Thr Ser Thr A.la Asn Thr Asn Met ?he Thr Tyr Glu 
195 200 205 

He Ala Pro Val Phe Val Leu Leu Glu Tyr Val Thr Leu Lys Lys Met 
210 215 220 

Arg Glu He lie Gly Trp Pro Gly Gly Ser Gly Asp Gly He Phe Ser 
225 230 235 240 

Pro Gly Gly Ala He Ser Asn Met Tyr Ala Met Met He Ala Arg Phe 
245 250 255 



Lys Met Phe Pro Glu Val Lys Glu Lys Gly Met Ala Ala Leu Pro Arg 
260 265 270 



40 



Leu He Ala Phe Thr Ser Glu His Ser His Phe Ser Leu Lys Lys Gly 
275 280 285 



45 



50 



Ala Ala Ala Leu Gly He Gly Thr Asp Ser Val He Leu He Lys Cys 
290 295 300 

Asp Glu Arg Gly Lys Met He Pro Ser Asp Leu Glu Arg Arg He Leu 
305 ' 310 315 320 

Glu Ala Lys Gin Lys Gly Phe Val Pro Phe Leu Val Ser Ala Thr Ala 
325 330 335 

Gly Thr Thr Val Tyr Gly Ala Phe Asp Pro Leu Leu Ala Val Ala Asp 
340 345 350 

He Cys Lys Lys Tyr Lys He Trp Met His Val As? Ala Ala Trp Gly 
355 360 365 
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Gly Gly Leu Leu Met Ser Arg Lys His Lys Trp Lys Leu Ser Gly Val 
370 375 380 

Glu Arg Ala Asn Ser Vai Thr Trp Asn Pro Kis Lys Met Met Gly Val 
385 390 395 400 

Pro Leu Gin Cys Ser Ala Leu Leu Val Arg Glu Glu Gly Leu Met Gin 
405 410 415 

Asn Cys Asn Gin Met His Ala Ser Tyr Leu Phe Gin Gin Asp Lys His 
420 425 430 

Tyr Asp Leu Ser Tyr Asp Thr Gly Asp Lys Ala Leu Gin Cys Gly Arg 
435 440 445 

His Val Asp Val Phe Lys Leu Trp Leu Met Trp Arg Ala Lys Gly Thr 
15 450 455 460 

Thr Gly Phe Glu Ala His Val Asp Lys Cys Leu Glu Leu Ala Glu Tyr 
465 470 475 480 

Leu Tyr Asn lie He Lys Asn Arg Glu Gly Tyr Glu Met Val Phe Asp 
*° 485 490 495 

Gly Lys Pro Gin His Thr Asn Vai Cys Phe Trp Tyr He Pro Pro Ser 
500 505 510 

Leu Arg Thr Leu Glu Asp Asn Glu Glu Arg Met Ser Arg Leu Ser Lys 
25 515 520 525 

Val Ala Pro Val He Lys Ala Arg Met Met Glu Tyr Gly Thr Thr Met 
530 535 540 



30 



35 



40 



Val Ser Tyr Gin Pro Leu Gly Asp Lys Val Asn Phe Phe Arg Met Val 
545 550 555 560 

He Ser Asn Pro Ala Ala Thr His Gin Asp He Asp Phe Leu He Giu 
565 570 575 

Glu He Glu Arg Leu Gly Gin Asp Leu 
580 585 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 110 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 

Met Ala Leu Trp Met Arg Leu Leu Pro Leu Leu Ala Leu Leu Ala Leu 
1 5 10 15 

Trp Gly Pro Asp Pro Ala Ala Ala Phe Val Asn Gin His Leu Cys Gly 
50 * ' 20 25 30 

Ser His Leu Val Glu Ala Leu Tyr Leu Val Cys Gly Glu Arg Gly Phe 
35 40 45 
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Phe Tyr Thr Pro Lys Thr Arg Arg Glu Ala Glu Asp Leu Gin Val Gly 
50 ' 55 60 

Gin Val Glu Leu Gly Gly Gly Pro Gly Ala Gly Ser Leu Gin Pro Leu 
5 65 70 75 80 

Ala Leu Glu Gly Ser Leu Gin Lys Arg Gly lie Val Glu Gin Cys Cys 
65 90 95 

Thr Ser lie Cys Ser Leu Tyr Gin Leu Glu Asn Tyr Cys Asn 
10 100 105 110 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 2457 base pairs 
15 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 



25 



35 



40 



45 



ACCCGCCCTC 


GCCGCTCGGC 


CCCGCGCGTC 


CCCGCGCGTG 


CCCTCCTCCC 


GCCACACGGC 


60 


ACGCACGCGC 


GCGCAGGGCC 


AAGCCGAGGC 


AGCCGCCCGC 


AGCTCGCACT 


CGCTGGCGAC 


120 


CTGCTCCAGT 


CTCCAAAGCC 


GATGGCATCT 


CCGGGCTCTG 


GCTTTTGGTC 


TTTCGGGTCG 


180 


GAAGATGGCT 


CTGGGGATTC 


CGAGAATCCC 


GGCACAGCGC 


GAGCCTGGTG 


CCAAGTGGCT 


240 


CAGAAGTTCA 


CGGGCGGCAT 


CGGAAACAAA 


CTGTGCGCCC 


TGCTCTACGG 


AGACGCCGAG 


300 


AAGCCGGCGG 


AGAGCGGCGG 


GAGCCAACCC 


CCGCGGGCCG 


CCGCCCGGAA 


C-GCCGCCTGC 


360 


GCCTGCGACC 


AGAAGCCCTG 


CAGCTGCTCC 


AAAGTGGATG 


TCAACTACGC 


GTTTCTCCAT 


420 


GCAACAGACC 


TGCTGCCGGC 


GTGTGATGGA 


GAAAGGCCCA 


CTTTGGCGTT 


TCTGCAAGAT 


480 


GTTATGAACA 


TTTTACTTCA 


GTATGTGGTG 


AAAAGTTTCG 


ATAGATCAAC 


CAAAGTGATT 


54 0 


GATTTCCATT 


ATCCTAATGA 


GCTTCTCCAA 


GAATATAATT 


GGGAATTGGC 


AGACCAACCA 


600 


CAAAATTTGG 


AGGAAATTTT 


GATGCATTGC 


CAAACAACTC 


TAAAATATGC 


AATTAAAACA 


660 


GGGCATCCTA 


GATACTTCAA 


TCAACTTTCT 


ACTGGTTTGG 


ATATGGTTGG 


ATTAGCAGCA 


720 


GACTGGCTGA 


CATCAACAGC 


AAATACTAAC 


ATGTTCACCT 


ATGAAATTGC 


TCCAGTATTT 


780 


GTGCTTTTGG 


AATATGTCAC 


AC TAAAGAAA 


AT GAG AG AAA 


TCATTGGCTG 


GCCAGGGGGC 


840 


TCTGGCGATG 


GGATATTTTC 


TCCCGGTGGC 


GCCATATCTA 


ACATGTATGC 


CATGATGATC 


900 


GCACGCTTTA 


AGATGTTCCC 


AGAAGTCAAG 


GAGAAAGGAA 


TGGCTGCTCT 


TCCCAGGCTC 


960 


ATTGCCTTCA 


CGTCTGAACA 


TAGTCATTTT 


TCTCTCAAGA 


AGGGAGCTGC 


AGCCTTAGGG 


1020 


ATTGGAACAG 


ACAGCGTGAT 


TCTGATTAAA 


TGTGATGAGA 


GAGGGAAAAT 


GATTCCATCT 


1080 


GATCTTGAAA 


GAAGGATTCT 


TGAAGCCAAA 


CAGAAAGGGT 


TTGTTCCTTT 


CCTCGTGAGT 


1140 


GCCACAGCTG 


GAACCACCGT 


GTACGGAGCA 


TTTGACCCCC 


TCTTAGCTGT 


CGCTGACATT 


1200 



i 

I 

55 

I 
j 
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TGCAAAAAGT ATAAGATCTG GATGCATGTG GATGCAGCTT GGGGTGGGGG A7TACTGATG 12 60 

TCCCGAAAAC ACAAGTGGAA ACTGAGTGGC GTGGAGAGGG CCAACTCTG7 GAZGTGGAAT 132 0 

5 CCACACAAGA TGATGGGAGT CCCTTTGCAG TGCTCTGCTC TCCTGGTTAG ASAAGAGGGA 1380 

TTGATGCAGA ATTGCAACCA AATGCATGCC TCCTACCTCT TTCAGCAAGA TAAACATTAT 14 40 

GACCTGTCCT ATGACACTGG AGACAAGGCC TTACAGTGCG GACGCCACGT TGATGTTTTT 15 00 

10 AAACTATGGC TGATGTGGAG GGCAAAGGGG ACTACCGGGT TTGAAGCGCA TG7TGATAAA 15 60 

TGTTTGGAGT TGGCAGAGTA TTTATACAAC AT C AT AAAAA ACCGAGAAGG AT AT GAG AT G 1620 

GTGTTTGATG GGAAGCCTCA GCACACAAAT GTCTGCTTCT GGTACATTCC TCCAAGCTTG 16B0 

15 CGTACTCTGG AAGACAATGA AGAGAGAATG AGTCGCCTCT CGAAGGTGGC TCCAGTGATT 17 4 0 

AAAGCCAGAA TGATGGAGTA TGGAACCACA ATGGTCAGCT ACCAACCCTT GGGAGACAAG 1800 

GTCAATTTCT TCCGCATGGT CATCTCAAAC CCAGCGGCAA CTCACCAAGA CATTGACTTC 1860 

20 CTGATTGAAG AAATAGAACG CCTTGGACAA GATTTATAAT AACCTTGCTC ACCAAGCTGT 1920 

TCCACTTCTC TAGAGAACAT GCCCTCAGCT AAGCCCCCTA CTGAGAAACT TCCTTTGAGA 198 0 

ATTGTGCGAC TTCACAAAAT GCAAGGTGAA CACCACTTTG TCTCTGAGAA CAGACGTTAC 204 0 

CAATTATGGA GTGTCACCAG CTGCCAAAAT CGTAGGTGTT GGCTCTGCTG GTCACTGGAG 2100 

TAGTTGCTAC TCTTCAGAAT ATGGACAAAG AAGGCACAGG TGTAAATATA G T AGCAGG AT 2160 

GAGGAACCTC AAACTGGGTA TCATTTGCAC GTGCTCTTCT GTTCTCAAAT GCTAAATGCA 2220 

AACACTGTGT ATTTATTAGT TAGGTGTGCC AAACTACCGT TCCCAAATTG GTGTTTCTGA 2 280 

AT G AC AT C AA CATTCCCCCA ACATTACTCC ATTACTAAAG ACAGAAAAAA AT AAAAAC AT 23 4 0 

AAAATATACA AACATGTGGC AACCTGTTCT TCCTACCAAA TATAAACTTG TGTATGATCC 2 400 

AAGTATTTTA TCTGTGTTGT CTCTCTAAAC CCAAATAAAT GTGTAAATGT GGACACA 24 57 
(2) INFORMATION FOR SEQ ID NO: 9: 



25 



30 



35 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3613 base pairs 

(B) TYPE: nucleic acid 
40 (C) STRANDEDNESS: double 

<D} TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



45 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:9: 

CAGCCCCTCT GGCAGGCTCC CGCCAGCGTC GCTGCGGCTC CGGCCCGGGA GCGAGCGCCC 60 

GGAGCTCGGA AAGATGCGGC GCCCGCGGCG GCCTGGGGGT CTCGGGGGAT CCGGGGGTCT 120 

CCGGCTGCTC CTCTGCCTCC TGCTGCTGAG CAGCCGCCCG GGGGGCTGCA GCGCCGTTAG 180 

TGCCCACGGC TGTCTATTTG ACCGCAGGCT CTGCTCTCAC CTGGAAGTCT GTATTCAGGA 24 0 

TGGCTTGTTT GGGCAGTGCC AGGTGGGAGT GGGGCAGGCC CGGCCCCTTT TGCAAGTCAC 300 
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20 


GCCAGAGbAA 


GbbAb-AbbA 


bbbbbbbAbA 


bbAbl bbbbA 


bAbbbb 1 Al b 


AbAAb b AAb b 


1 n 0 r\ 




ACTAGGGGAT 


bGTGbAbAGA 


AbCbTGbTTL 


bbbAGCTbTG 


bAbbbAbATb 


bbbb 1 CTGCA 


1060 




GAGGCTGGCC 


GCTGTGCTGG 


CGGGCTATGG 


GGTAGAGCTG 


CGTCAGCTGA 


CbCCTGAGCA 


114 0 


25 


GCTCTCCACA 


CTCCTGACCC 


TGCTGCAGCT 


ACTGCCCAAG 


GGTGCAGGAA 


GAAATCCGGG 


1200 


AGGGGTTGTA 


AATGTTGGAG 


b T GAT AT b AA 


G AAAAL AAT G 


GAGGGGCCGG 


T G GAGGG C AG 


12 60 




AGACACAGCA 


GAGCTTCCAG 


CCCGCACATC 


CCGCATGCCT 


GGACACCCCA 


CTGCCAGCCC 


1320 




TACCTCCAG i 


GAAGTCbAGb 


AGGTGbbAAG 


CCCTGTCTCC 


TCTGAGCCTC 


CCAAAGCTGC 


1380 


30 


CAGACCCCCT 


GTGACACCTG 


TCCTGCTAGA 


GAAGAAAAGC 


CCACTGGGCC 


AGAGCCAGCC 


14 40 




CACGGTGGCA 


GGACAGCCCT 


CAGCCCGCCC 


AGCAGCAGAG 


GAATATGGCT 


ACATCGTCAC 


1500 




TGATCAGAAG 


CCCCTGAGCC 


TGGCTGCAGG 


AGTGAAGCTG 


CTGGAGATCC 


TGGCTGAGCA 


1560 


35 


TGTGCACATG 


TCCTCAGGCA 


GCTTCATCAA 


CATCAGTGTG 


GTGGGACCAG 


CCCTCACCTT 


1620 




CCGCATCCGG 


CACAATGAGC 


AGAACCTGTC 


TTTGGCTGAT 


GTGACCCAAC 


AAGCAGGGCT 


1680 




GGTGAAGTCT 


GAACTGGAAG 


CACAGACAGG 


GCTCCAAATC 


TTGCAGACAG 


GAGTGGGACA 


1740 


40 


GAGGGAGGAG 


GCAGCTGCAG 


TCCTTCCCCA 


AACTGCGCAC 


AGCACCTCAC 


CCATGCGCTC 


1800 




AGTGCTGCTC 


ACTCTGGTGG 


CCCTGGCAGG 


TGTGGCTGGG 


CTGCTGGTGG 


CTCTGGCTGT 


1860 




GGCTCTGTGT 


GTGCGGCAGC 


ATGCGCGGCA 


GCAAGACAAG 


GAGCGCCTGG 


CAGCCCTGGG 


1920 


45 


GCCTGAGGGG 


GCCCATGGTG 


ACACTACCTT 


TGAGTACCAG 


GACCTGTGCC 


GCCAGCACAT 


1980 


GGCCACGAAG 


TCCTTGTTCA 


ACCGGGCAGA 


GGGTCCACCG 


GAGCCTTCAC 


GGGTGAGCAG 


2040 




TGTGTCCTCC 


CAGTTCAGCG 


ACGCAGCCCA 


GGCCAGCCCC 


AGCTCCCACA 


GCAGCACCCC 


2100 


50 


GTCCTGGTGC 


GAGGAGCCGG 


CCCAAGCCAA 


CATGGACATC 


TCCACGGGAC 


ACATGATTCT 


2160 


GGCATACATG 


GAGGATCACC 


TGCGGAACCG 


GGACCGCCTT 


GCCAAGGAGT 


GGCAGGCCCT 


2220 




CTGTGCCTAC 


CAAGCAGAGC 


CAAACACCTG 


TGCCACCGCG 


CAGGGGGAGG 


GCAACATCAA 


22eo 
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5 



15 



20 



25 



30 



35 



AAAGAACCGG 


CATCCTGACT 


TCCTGCCCTA 


TGACCATGCC 


CGCATAAAAC 


TGAAGGTGGA 


2 34 0 


GAGCAGCCCT 


TCTCGGAGCG 


ATTACATCAA 


CGCCAGCCCC 


ATTATTGAGC 


ATGACCCTCG 


24 00 


GATGCCAGCC 


TACATAGCCA 


CGCAGGGCCC 


GCTGTCCCAT 


ACCATCGCAG 


ACTTCTGGCA 


7 a fin 

£. H D U 


KATGGTGTGG 


GAGAGCGGCT 


GCACCGTCAT 


CGTCATGCTG 


ACCCCGCTGG 


TGGAGGATnn: 


£ V 4 U 


T G T CAAGCAG 


TGTGACCGCT 


ACTGGCCAGA 


TGAGGGTGCC 

A \J *V>J w W A \J W W 


TCCCTCTACC 


ACGTATATG A 




GGTGAACCTG 


GTGTCGGAGC 


ACATCTGGTG 


CGAGGACTTT 


CTGGTGCGGA 


GCTTCTACCT 

ww a 4 w 2 rv ww x 




GAAGAACGTG 


CAGACCCAGG 


AGACGCGCAC 


GCTCACGCAG 


TTCCACTTCC 


TCAGCTGGCC 

a vnvj w x w w w w- 


p 7 nn 


GGCAGAGGGC 


ACACCGGCCT 


CCACGCGGCC 


CCTGCTGGAC 


TTCCGCAGGA 


AGGTGAACAA 


77 fin 


GTGCTACCGG 


GGCCGCTCCT 


GCCCCATCAT 


CGTGCACTGC 


AGTGATGGTG 


CGGGGAGGAC 


<C D A U 


CGGCACCTAC 


ATCCTCATCG 


ACATGGTCCT 


GAACCGCATG 


GCAAAAGGAG 


TGAAGG An AT 


9 r fi n 


TGACATCGCT 


GCCACCCTGG 


AGCATGTCCG 


TGACCAGCGG 


CCTGGCCTTG 


TrfGrTrTia 




GGACCAGTTT 


GAATTTGCCC 


TGACAGCCGT 


GGCGGAGGAA 


GTGAATGCCA 


TrrTrt&GGr 


nnn 


CCTGCCCCAG 


TGAGACCCTG 


GGGCCCCTTG 


GCGGGCAGrr 


Wrt w W W J. W 1 O 1 


rrrTrTTTrf 


i n e a 


CTGTGTGAGC 


ATCTCTGTGT 


ACCCACTCCT 

n^uvn^ x ww a 






L. 1 i (jtalal^/^ i la 


lion 


CTCAGCCCTT 


CCTAGAAGAG 


TCAGGAAGGG 


AAA^rfA^AA 






Jl o 0 


CGCATGCCAG 


AGCCTGGGGr 




LtnuuuUn 1 w 


w wM. 1 bobob 1 




32 4 0 


GGAGGAGAGG 


AAAGGACATG 


GGTAGCAATT 


CT AC CCAGAG 


CCTTCTCCTG 


CCTACATTCC 


3300 


CTGGCCTGGC 


TCTCCTGTAG 


CTCTCCTGGG 


GTTCTGGGAG 


TTCCCTGAAC 


ATCTGTGTGT 


3360 


GTCCCCCTAT 


GCTCCAGTAT 


GGAAGAATGG 


GGTGGAGGGT 


CGCCACACCC 


GGCTCCCCCT 


3420 


GCTTCTCAGC 


CCCGGGCCTG 


CCTCTGACTC 


ACACTTGGGC 


GCTCTGCCCT 


CCCTGGCCTC 


3480 


ACGCCCAGCC 


TGGTCCCACC 


ACCCTCCCAC 


CATGCGCTGC 


TCAACCTCTC 


TCCTTCTGGC 


3540 


GCAAGAGAAC 


ATTTCTAGAA 


AAAACTACTT 


TTGTACCAGT 


GTGAATAAAG 


TTAGTGTGTT 


3600 


GTCTGTGCAG 


CTG 










3613 



(2) INFORMATION FOR SEQ ID NO: 10: 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4992 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDE DNESS : single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

45 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
CTCGAGGGGC CTAGACATTG CCCTCCAGAG AGAGCACCCA ACACCCTCCA GGCTTGACCG 60 
50 GCCAGGGTGT CCCCTTCCTA CCTTGGAGAG AGCAGCCCCA GGGCATCCTG CAGGGGGTGC 120 

TGGGACACCA GCTGGCCTTC AAGGTCTCTG CCTCCCTCCA GCCACCCCAC TACACGCTGC 180 

55 
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TGGGATCCTG 


GA * CTCAGGT 




r a atactic; r* 


a a arTcrTAC 


TCATCCACGA 


24 0 


AGGCCCTCCT 


GGGCATGn i G 


o ILL 1 ii.L^n 




rTft^TfTTCA 


CACACCTTGT 


300 


TAGTGCCCAG 


CCCCTGAGGT 


TGCAGG I GGG 




O O ^ 1 \J X w 




fin 


AAGCCCTGGG 


GAAGTGCCTG 


CCTTGCCTCL 


GCGGGGCCG I 




OVJk. Ibl Ov.b 


/on 

■S u 


TCCTACCTGG 


GCTCCCCCCA 


TCCAGCCTCC 


CTCCGTACAL 




nbbnbbLnLL 


A ft A 
SOU 


CATGTCCTCT 


CCAGCTGCCG 


GGCCTCAGAG 


CACTGTGGCG 




G*-GAGv,GGA i. 




GTCCTGCTGT 


GGCATGGCTC 


AGGGTGGAAA 


GGGCGGAAGG 


GAGGGG I CG 1 


GG AGA l AGG J. 


buy 


GGTGCCCACT 


ACCAAACCCG 


CTCGGGGCAG 


G AGAG C C AAA 


GGG I GGG Ibl 


G 1 GGAGAGGG 


DOU 


GCCCCGAGAG 


GTTCCGAGGC 


TGAGGCCAGG 


GTGgGACATA 


GGGA1 bLbrtb 


GGGGGGGGGG 


f z u 


AC AGGAT AC T 


CCAACCTGCC 


TGGCCCCATG 


GTCTCATCG T 




IjljALL I Lb 1 G 


Ton 


ATCCTGCCCC 


TGGTGCTAAG 


AGGCAGGTAA 


GGGGCTGGAG 


G AG t, A G G G C 




c ^ n 


TGCCCCCTCA 


CCATGGGTCA 


GGCTGGACCT 


CCAGGTGCCT 


GTTCTGGGGA 


GG * GGGAGGG 


q n o 

y u u 


CCGGAGGGGT 


GTACCCCAGG 


GGCTCAGCCC 


AGATGACAC * 


ATGGGGG I GA 


1 GG 1 G I GA 1 G 


y bu 


GGACCTGGCC 


AGGAGAGGGG 


AGATGGGCTC 


CCAGAAGAGG 


AGTGGGGGGT 


GA.GAGGGTGC 




CTGGGGGGCC 


AGGACGGAGC 


TGGGCCAGTG 


CACAGCTTCC 


LALALL TGCC 


CACGCCGAGA 


1 U o U 


GTCCTGCCGC 


CACCCCCAGA 


TCACACGGAA 


GATGAGGTCC 


GAGTGGCCTG 


C i GAGGACTT 


114 0 


GCTGCTTGTC 


CCCAGGTCCC 


CAGGTCATGC 


CCTCCTTCTG 


CCACCCiGGG 


GAGCTG AG G G 


1200 


CCTCAGCTGG 


GGCTGCTGTC 


CTAAGGCAGG 


GTGGGAACTA 


GGCAGCCAGC 


AGGGAGGGGA 


1260 


CCCCTCCCTC 


ACTCCCACTC 


TCCCACCCCC 


ACCACCTTGG 


CCCATCCATG 


GCGGCATCTT 


1320 


GGGCCATCCG 


GGACTGGGGA 


CAGGGGTCCT 


GGGGACAGGG 


GTCCGGGGAC 


AGGGTCCTGG 


1380 


GGACAGGGGT 


GTGGGGACAG 


GGGTCTGGGG 


ACAGGGGTGT 


GGGGACAGGG 


GTGTGGGGAC 


1440 


AGGGGTCTGG 


GGACAGGGGT 


GTGGGGACAG 


GGGTCCGGGG 


ACAGGGGTGT 


GGGGACAGGG 


1500 


GTCTGGGGAC 


AGGGGTGTGG 


GGACAGGGGT 


GTGGGGACAG 


GGGTCTGGGG 


ACAGGGGTGT 


1 560 


GGGGACAGGG 


GTCCTGGGGA 


CAGGGGTGTG 


GGGACAGGGG 


TGTGGGGACA 


GGGGTGTGGG 


1620 


GACAGGGGTG 


TGGGGACAGG 


GGTCCTGGGG 


ATAGGGGTGT 


GGGGACAGGG 


GTGTGGGGAC 


1680 


AGGGGTCCCG 


GGGACAGGGG 


TGTGGGGACA 


GGGGTGTGGG 


GACAGGGGTC 


CTGGGGACAG 


1740 


GGGTCTGAGG 


ACAGGGGTGT 


GGGCACAGGG 


GTCCTGGGGA 


CAGGGGTCCT 


GGGGACAGGG 


1800 


GTCCTGGGGA 


CAGGGGTCTG 


GGGACAGCAG 


CGCAAAGAGC 


CCCGCCCTGC 


AGCCTCCAGC 


1860 


TCTCCTGGTC 


TAATGTGGAA 


AGTGGCCCAG 


GTGAGGGCTT 


TGCTCTCCTG 


GAGACATTTG 


1920 


CCCCCAGCTG 


TGAGCAGGGA 


CAGGTCTGGC 


CACCGGGCCC 


CTGGTTAAGA 


CTCTAATGAC 


1980 


CCGCTGGTCC 


TGAGGAAGAG 


GTGCTGACGA 


CCAAGGAGAT 


CTTCCCACAG 


ACCCAGCACC 


2040 


AGGGAAATGG 


TCCGGAAATT 


GCAGCCTCAG 


CCCCCAGCCA 


TCTGCCGACC 


CCCCCACCCC 


2100 


GCCCTAATGG 


GCCAGGCGGC 


AGGGGTTGAC 


AGGTAGGGGA 


GATGGGCTCT 


GAGACTATAA 


2160 
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ZAAGCAGGTC 


TGTTCCAAGG 


GCCTTTGCGT 


CAGGTGGGCT 


CAGGGTTCCA 


G G<oT G GCTG G 


2280 




ACCCCAGGCC 


CCAGCTCTGC 


AGCAGGGAGG 


ACGTGGCTGG 


GCTCGTGAAG 


CATGTGGGGG 


2340 


5 


TG AG C C CAG G 


GGCCCCAAGG 


CAGGGCACCT 


GGCCTTCAGC 


CTGCCTCAGC 


CCTGCCTGTC 


2400 






CTGTCCTTCT 

V_* A V> A <w s_ A A >^ A 


GCCATGGCCC 


TGTGGATGCG 


CCTCCTGCCC 


C7GCTGGCGC 


2460 




Tf^CTGGCCCT 

X Ob* 1 ^J — . ^_ 1 


CTGGGGACCT 


GACCCAGCCG 


CAGCCTTTGT 


GAACCAACAC 


CTGTGCGGCT 


2520 


10 


LnUn^L 1 oo i 


GGAAGCTCTC 


TACCTAGTGT 


GCGGGGAACG 


AGGCTTCTTC 


TACACACCCA 


2580 






GGAGGCAGAG 

O /»v7 vj j»\jv» vj 


GACCTGCAGG 


GTGAGCCAAC 


CGCCCATTGC 


TGCCCCTGGC 


2640 




V* la b» b v- o 




TCCTGGCGCT 


CCCACCCAGC 


ATGGGCAGAA 


GGGGGCAGGA 


2700 


15 




GAGCAGGGGG 

V^-^l VJ V— /i w\J VJ \J 


TCAGGTGCAC 


TTTTTTAAAA 


AGAAGTTCTC 


TTGGTCACGT 


21 60 




bb. I M/vrirto 1 o 


k-rt«JVi i \_v—v_ 


TGTGGCCCAG 

A VJ A U JVvYjWJiU 


TCAGAATCTC 


AGCCTGAGGA. 


CGGTGTTGGC 


2820 






CCG AG ATACA 


TCAGAGGGTG 


GGCACGCTCC 


TCCCTCCACT 


CGCCCCTCAA 


2880 


20 




CGCAGCCCAT 

' — VJ ^ livJ w V^ V— -1 


TTCTCCACCC 


TCATTTGATG 


ACCGCAGATT 


CAAGTGTTTT 


2940 




KTTAAGTAAA 


GTCCTGGGTG 

bj A V* V- X VJ VJVJ A *— J 


ACCTGGGGTC 


ACAGGGTGCC 


CCACGCTGCC 


TGCCTCTGGG 


3000 






ATCACGCCCG 


GAGGAGGGCG 


TGGCTGCCTG 


CCTGAGTGGG 


CCAGACCCCT 


3060 


25 


GTCGCCAGCC 


TCACGGCAGC 


TCCATAGTCA 


GGAGATGGGG 


AAGATGCTGG 


GGACAGGCCC 


3120 


TGGGGAGAAG 


TACTGGGATC 


ACCTGTTCAG 


GCTCCCACTG 


TGACGCTGCC 


CCGGGGCGGG 


3180 




GGAAGGAGGT 


GGGACATGTG 

uUurtv.n 1 VJ A VJ 


GGCGTTGGGG 

UVJ V^VJ A A VJVJVJU 


CCTGTAGGTC 


CACACCCAGT 


GTGGGTGACC 


3240 


30 


bi.b.b.blb-1 


rrTGGGTrr a 


GrrrGGGTGG 


AGATGGGTGG 

/^.O-^V 1 JUOi VJ vj 


GAGTGCGACC 


T AGGGGTGGC 

i. /^VJ VJ VJT X VjrVJfL- 


3300 


o o o b» Ao o b Li vj 


blb.rtb. ILllOlL. 


1 ^.ub 1 onb* 1 o 


1 o i b b l bb i o 




b-b* X bbbbbb 1 


J JuU 






b- 1 Ob 1 b, 1 ObO 


bOOb/AbO 1 bb 


TGGCAflTGGG 


GGAGGTf^GAfJ 


b- 1 bo j^bbbb 


O *t £. bf 




ob-b-b. 1 oo 1 


rtb>ob.ribjb»b^ 1 o 


bnbbbbi lbu 


bbb 1 ovjrr-lvjbjo 


o 1 b-b-b* 1 brb^r\b7 


n/\obo 1 bbbrt 




35 


J. Ibi bjo/vftb-rt 




no b>\ lblu^l 


bbb 1 b Inbbn 


ub 1 bbMortnb 


1 rtb 1 obnMb 1 






>io/\b-o b./\o V» b» 


1 (jbrtuuLrtlaL. 


f <** & & r*/* 

LLLnLnLLL o 


b b, bjb,b- 1 bb 1 b 


v.. t\\* w bj rio n Vj 


vo Av 1 bbr.n X f\n 


J DUU 






b.b->ib>bb,b, I uv. 


TrTrrri-TrT 

lul bbbu i b 1 


^lglolbl 1 b 


UbobLbb 1 o vj 


bbbKrtbbbbb 


t <; fin 

JDDv 


40 




Ab^ lb! Jul bft 


ubbbb 1 b-bb/i 


frrTrT r"r f* & 

Ub v^brt 


bub 1 b. 1 b i b?b? 


bt 1 ol_b.b/ib,rtb3 


-J / ^ u 




GTGCCAACGC 


CAGGCAGGCC 


CAGCATGCAG 


TGGCTCTCCC 


CAAAGCGGCC 


ATGCCTGTTG 


3780 




GCTGCCTGCT 


GCCCCCACCC 


TGTGGCTCAG 


GGTCCAGTAT 


GGGAGCTTCG 


GGGGTCTCTG 


3840 


45 


AGGGGCCAGG 


GATGGTGGGG 


CCACTGAGAA 


GTGACTCTGT 


CAGTAGCCGA 


CCTGGAGTCC 


3900 




CCAGAGACCT 


TGTTCAGGAA 


AGGGAATGAG 


AACATTCCAG 


CAATTTTCCC 


CCCACCTAGC 


3960 




CCTCCCAGGT 


TCTATTTTTA 


GAGTTATTTC 


TGATGGAGTC 


CCTGTGGAGG 


GAGGAGGCTG 


4020 


50 


GGCTGAGGGA 


GGGGGTCCTG 


CAGGGCGGGG 


GGCTGGGAAG 


GTGGGGAGAG 


GCTGCCGAGA 


4080 




GCCACCCGCT 


ATCCCCAGCT 


CTGGGCAGCC 


CCGGGACAGT 


CACACACCCT 


GGCCTCGCGG 


4140 
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15 



20 



CCCAAGCTGG CAGCCGTCTG CAGCCACAGC TTATGCCAGC CCAGGT CC7-.Z- CC AGACACCT 4 200 

GAGGGACCCA CTGGTGCCTT GGAGGAAGCA GGAGAGGTCA GATGGCACC-. TGAGCTGGGG 4 2 60 

CAGGTGCAGG GACCGTGGCA GCACCTGGCA GGGCCTCAGA ACCCATGCC7 TGGGCACCCC 4 320 

GGCCATGAGG CCCTGAGGAT TGCAGCCCAA GAGAAGCAGG GAACGCCAG^ GCCACAGGGG 4 380 

CAGAGACCAG GCCAGGGTCC CTTGCGGCCC TTAGCCCACC CCCTCCCAGT AAGCAGGGGC 4 4 40 

TGCTTGGCTA GGCTTCCTTT TGCTACAGAC CTGCTGCTCA CCCAGAGGCC CACGGGCCCT 4 500 

AGTGACAAGG TCGTTGTGGC TCCAGGTCCT TGGGGGTCCT GACACAGAGZ CTCTTCTGCA 4 5 60 

GCACCCCTGA GGACAGGGTG CTCCGCTGGG CACCCAGCCT AGTGGGCAGA CGAGAACCTA 4 620 

GGGGCTGCCT GGGCCTACTG TGGCCTGGGA GGTCAGCGGG TGACCCTAGG TACCCTGTGG 4 680 

CTGGGCCAGT CTGCCTGCCA CCCAGGCCAA ACCAATCTGC ACCTTTCCTG AGAGCTCCAC 4 74 0 

CCAGGGCTGG GCTGGGGATG GCTGGGCCTG GGGCTGGCAT GGGCTGTGGC TGCAGACCAC 4 800 

TGCCAGCTTG GGCCTCGAGG CCAGGAGCTC ACCCTCCAGC TGCCCCGCCT CCAGAGTGGG 4 8 60 

GGCCAGGGCT GGGCAGGCGG GTGGACGGCC GGACACTGGC CCCGGAAGAG GAGGGAGGCG 4 920 

GTGGCTGGGA TCGGCAGCAG CCGTCCATGG GAACACCCAG CCGGCCCCAC TCGCACGGGT 4 98 0 

AGAGACAGGC GC 4 992 
(2) INFORMATION FOR SEQ ID NO: 11: 

{ij SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
(v) FRAGMENT TYPE: C- terminal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Xaa Xaa Gly Ser His His His His His His 
15 10 

40 (2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc - "DNA for bridge peptide" 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
AAGAAGAAGC GGCCGCGAAA GAAGAAG 27 
(2) INFORMATION FOR SEQ ID NO: 13: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) . STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc - "DMA for bridge peptide" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
AAGAAGAAGC GATCGCGAAA GAAGAAG 2 7 



Claims 

1 . A fusion protein having epitopes of at least two of the autoantigens glutamic acid decarboxylase (GAD65), islet cell 
antigen (IA2) and preproinsulin (PPINS) wherein said epitopes are connected with a linker peptide, said fusion pro- 
tein being able to bind to a solid phase. 

2. The fusion protein according to claim 1 having epitopes of each of the autoantigens GAD65, IA2 and PPINS. 

3. The fusion protein according to claim 2 wherein 

the epitope of IA2 comprises the amino acids 771-979 of the amino acid sequence shown in Figure 2a. 

the epitope of GAD65 comprises the amino acids 1 02-585 of the amino acid sequence shown in Figure 2b, and 

the epitope of PPINS comprises all the amino acids 1 -1 10 of the amino acid sequence shown in Figure 2c. 

4. The fusion protein according to claim 1 wherein the linker peptide comprises lysine and argine residues. 

5. The fusion protein according to claim 4 wherein said linker peptide is provided with a member of an affinity binding 
pair so as to enable the binding of said fusion protein to the solid phase. 

6. The fusion protein according to claim 5 wherein the affinity binding pair is biotin - streptavidin. 

7. A cDNA encoding the fusion protein according to claim 1 wherein said cDNA comprises the nucleotide sequences 
encoding the epitopes of at least two of the autoantigens glutamic acid decarboxylase (GAD65), islet cell antigen 
(IA2) and preproinsulin (PPINS). 

8. A cDNA encoding the fusion protein according to claim 3 wherein said cDNA comprises the nucleotide sequences 

a) nucleotides 131 1 to 1 755 of the sequence according to SEQ ID NO: 8 encoding GAD65, aa 102-585, 

b) nucleotides 2313 to 2937 of the sequence according to SEQ ID NO: 9 encoding IA2, aa 771-979, and 

c) nucleotides 2424 to 2610 and 3397 to 3539 of the sequence according to SEQ ID NO: 10 encoding PPINS. 
aa 1-1 10, where said nucleotide sequences a), b) and c) can appear in any relative order. 

9. A vector comprising the cDNA according to claim 7 or 8. 

10. An E. coli cell encompassing the cDNA according to claim 7. 

11. An immunoassay for the simultaneous determination in a sample of a person's body fluid of at least two insulin 
dependent diabetes mellitus (IDDM) related autoantibodies, wherein each autoantibody is specific for an epitope 
of the autoantigens glutamic acid decarboxylase (GAD65). islet cell antigen (IA2) or preproinsulin (PPINS). said 
immunoassay comprising the steps of 
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incubating said sample with a fusion protein according to claim 1, said fusion protein being bound to a solid 
support, 

adding at least one labeled reagent capable of binding to one or more of said autoantibodies, and 
quantifying the signals from the labels bound to the solid phase. 

12. The immunoassay according to claim 1 1 wherein the labeled reagent is an anti-human monoclonal antibody 

13. The immunoassay according to claim 1 1 wherein the labeled reagent comprises at least two antigens labeled with 
different labels, each antigen being one of the autoantigens GAD65, IA2 or PPINS; or proteins comprising epitopes 
thereof. 

14. The immunoassay according to claim 11 wherein the labeled reagent comprises three antigens labeled with the 
same label, each antigen being one of the autoantigens GAD65, IA2 or PPINS; or proteins comprising epitopes 
thereof. 

1 5. The immunoassay according to claim 1 1 wherein the label is a fluorescent lanthanide chelate. 

16. A method for diagnosing a person's risk of developing insulin dependent diabetes mellitus (IDDM), said method 
comprising the determination in a sample of said person's body fluid of at least two insulin dependent diabetes mel- 
litus (IDDM) related autoantibodies specific for an epitope of the autoantigens glutamic acid decarboxylase 
(GAD65), islet cell antigen (IA2) or preproinsulin (PPINS), wherein the presence of at least two of said autoantibod- 
ies are indicative for said person's risk of developing IDDM. 
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FIG. 4 
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